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SUMMARY 


Problem 


A  research  effort  was  conducted  at  the  Navy  Personnel  Research  and 
Development  Center  (NAVPERSRANDCEN)  using  the  ECHO  technique,  a  technique 
for  attitude  assessment  in  which  responses  are  elicited  to  repeated  open- 
ended  questions.  During  the  course  of  this  research,  it  was  found  that 
the  existing  computer  programs  available  for  analyzing  ECHO  and  other 
categorized  data  were  inadequate  for  carrying  out  the  proposed  analysis 
plan. 

Objective 

The  objective  of  the  present  effort  was  to  develop  an  efficient 
computer  program  to  analyze  the  data  generated  by  use  of  semi-structured 
data  collection  techniques  such  as  ECHO. 

Approach 

A  review  of  software  packages,  both  those  developed  at  NAVPERSRANDCEN 
and  those  available  commercially,  revealed  that  no  program  existed  which 
satisfied  the  project  requirements.  Therefore,  the  development  of  a  com¬ 
prehensive  computer  program  was  undertaken.  This  program  was  designed  to 
satisfy  the  data  analysis  requirements  of  the  ongoing  ECHO  project  and  to 
be  useful  to  other  Investigators  employing  open-ended  question  techniques 
such  as  ECHO. 

Results 


A  computer  program  called  GURU  was  developed.  The  program  provides 
extensive  descriptive  statistics  of  categorized  data  and  allows  great 
flexibility  in  comparing  various  groups  of  respondents  as  well  as  differ¬ 
ent  classifications  of  the  same  data. 
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INTRODUCTION 


Problem 


A  research  effort  was  conducted  at  the  Navy  Personnel  Research  and 
Development  Center  (NAVPF.RSRANDCEN)  using  the  ECHO  technique,  a  technique 
for  attitude  assessment  in  which  responses  are  elicited  to  repeated  open- 
ended  questions.  During  the  course  of  this  research,  it  was  found  that 
the  existing  computer  programs  available  for  analyzing  ECHO  and  other 
categorized  data  were  inadequate  for  carrying  out  the  proposed  analysis 
plan. 

Background 

The  ECHO  technique  is  a  semistructured  data  collection  technique  used 
to  observe,  quantify,  and  describe  ".  .  .  the  patterns  of  value  and  in¬ 
fluence  that  are  felt,  verbally  expressed,  and  often  acted  on  in  human 
society"  (Barthol  &  de  Mille,  1969). 

Basically  the  ECHO  technique  involves  giving  each  respondent  20  computer 
cards,  each  containing  two  questions.  One  of  the  questions  is  designed  to 
reveal  how  the  respondent  feels  about  some  aspect  of  the  topic  being 
studied;  that  is,  it  requests  him  to  list  some  event  that  indicates  his 
value  judgment  dealing  with  some  area  being  studied.  The  second  question 
is  designed  to  reveal  who  would  be  concerned  with  that  aspect  or  judgment. 
Each  card  is  prepunched  to  distinguish  the  respondent,  the  group,  the 
specific  card,  and  the  specific  questions  asked.  However,  the  personal 
identity  of  the  respondent  is  not  provided. 

Ten  of  the  cards  are  called  "goods,"  because  the  questions  they  con¬ 
tain  are  of  a  positive  nature.  For  example,  a  "good"  might  ask:  "What 
is  a  good  thing  to  do  on  the  job?"  and  "Who  would  approve?"  The  other 
10  are  called  "bads,"  because  of  their  negative  nature.  For  example, 
a  "bad"  corresponding  to  the  "good"  above  would  ask:  "What  is  a  bad  thing 
to  do  on  the  job?"  and  "Who  would  disapprove?"  Subjects  are  asked  to 
answer  each  question  on  the  card  and  to  supply  demographic  information.1 

The  responses  obtained  are  inductively  categorized  by  a  team  of  three 
people.  The  responses  to  "goods"  and  "bads"  are  analyzed  separately  under 
specified  sets  of  procedures  that  sometimes  require  a  number  of  classi¬ 
fication  iterations  before  reliability  is  achieved.  Finally,  the  classi¬ 
fication  codes  are  inserted  into  the  computer  cards  and  the  data  are  pro¬ 
cessed  through  such  computer  programs  as  UNIKOUNT,  PERZPROB,  and  ROLLET1. 
These  programs  provide  (1)  lists  of  response  category  hierarchies,  (2) 
tables  comparing  groups,  and  (3)  a  comparison  of  the  classification 
schemes  (Barthol  &  de  Mille,  1969). 


'See  Barthol  and  de  Mille  (1969)  and  Barthol  and  Bridge  (1968)  for 
a  more  complete  description  of  the  ECHO  technique. 
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Objective 

The  present  effort  was  undertaken  to  develop  an  efficient  computer 
program  to  analyze  data  generated  by  use  of  semistructured  data  collection 
techniques  such  as  ECHO.  It  was  felt  that  a  computer  program  should  be 
designed  and  developed  that  would  (1)  reduce  the  data  card  handling  and 
operator  time  required  for  analysis,  C2)  allow  for  user  flexibility  with 
regard  to  both  input  format  and  desired  output,  and  (3)  accommodate  cate¬ 
gorized  data  generated  by  unstructured  or  semistructured  methods  other 
than  the  ECHO  technique. 
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APPROACH 


To  fulfill  the  objective,  a  search  was  conducted  to  determine  if  an 
adequate  computer  program  was  available.  Both  NAVPERSRANDCEN  resources 
and  those  programs  available  commercially  were  examined.  Since  an  adequate 
program  was  not  found,  the  development  of  a  comprehensive  computer  program 
was  undertaken.  This  program  was  designed  to  satisfy  the  data  analysis 
requirements  of  the  ongoing  ECHO  project  and  to  be  useful  to  other  inves¬ 
tigators  employing  open-ended  question  techniques  such  as  ECHO. 
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RESULTS 


A  computer  program  called  GURU  was  developed  for  use  in  research  pro¬ 
jects  where  responses  are  generated  by  unstructured  or  semistructured 
methods  and  then  organized  through  content  analysis  or  categorization. 

It  is  designed  to  run  on  the  PL/I  OPTIMIZING  COMPILER  using  the  IBM  360/ 
65  monitor. 

Potential  Applications 


The  use  of  a  structured  data  collection  technique,  such  as  a  forced- 
choice  survey  or  a  checklist,  is  appropriate  when  an  investigator  has 
knowledge  about  the  relevant  variables  in  the  population  being  studied. 

With  such  knowledge,  he  can  ask  the  right  questions  to  assess  the  saliency 
of  these  variables  with  respect  to  a  particular  topic  area. 

However,  for  exploratory  research,  unstructured,  open-ended  or  semi¬ 
structured  techniques  such  as  ECHO  are  more  appropriate  because  they  pre¬ 
sume  that  the  investigator  has  limited  knowledge  about  the  relevant  vari¬ 
ables  of  a  particular  population.  Such  techniques  can  be  used  to  assess 
the  values  or  basic  attitudes  held  by  members  of  the  group  by  asking 
the  following  kinds  of  questions: 

1.  What  benefits  have  you  obtained  from  the  training  program? 

2.  Why  did  you  decide  to  leave  (or  join)  the  Navy? 

3.  If  you  have  objections  to  women  serving  aboard  ship,  what  are  they? 

4.  What  aspects  of  your  job  motivate  you  to  produce  to  full  capacity? 

5.  What  turns  you  off  or  demotivates  you  from  accomplishing  any¬ 
thing  on  the  job? 

In  research  projects  where  responses  are  generated  by  unstructured 
or  semistructured  methods  and  then  organized  through  content  analysis 
or  categorization,  the  GURU  computer  program  described  herein  is  useful 
in  gaining  quantitative  insights  into  the  data  obtained  by: 

1.  Counting  responses  in  each  category  of  classified  data,  computing 
percentages  and  ranks,  and  printing  tables  of  rank-ordered  categories  with 
their  titles. 

2.  For  any  subgroup  in  the  population,  computing  and  presenting  the 
frequency,  percentage,  and  rank  of  each  category  and  then  computing 

the  difference  between  groups  with  the  corresponding  probabilities. 

3.  Printing  and/or  punching  the  written  responses  occurring  in  tiie 
Individual  categories. 

4.  Comparing  different  categorizations  of  the  same  data. 
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This  means  that,  from  any  data  base,  the  user  could  (1)  print  tables  of 
rank-ordered  categories  with  their  frequencies,  percentages,  and  titles, 

(2)  print  the  actual  responses  listed  by  category  and  (if  desired)  provide 
this  listing  on  subsamples  within  the  total  sample,  (3)  compute  the  possi¬ 
bility  of  observing  different  percentages  between  the  groups  in  each  of 
the  categories,  and  (4)  if  the  responses  were  classified  two  or  more  times, 
see  how  the  responses  in  one  category  scheme  are  distributed  over  a  second 
set  of  categories. 

If,  for  example,  the  work  motivation  of  employees  was  being  investigated 
by  asking  open-ended  questions,  the  responses  to  the  questions  would  be 
categorized  and  the  classification  codes  entered  on  input  data  cards.  The 
GURU  program  would  provide  descriptive  statistics  (i.e.,  the  frequency 
and  percentage  of  responses  in  each  category,  lists  of  response  category 
hierarchies ,  and  lists  of  classified  responses).  In  this  example,  the 
program  might  show  that  (1)  40  percent  of  the  respondents  find  supervision 
is  demotivating  them  from  contributing  to  tiieir  full  capacity,  (2)  20  per¬ 
cent  object  to  the  plant  facilities  (inadequate  lighting,  cold  buildings, 
etc.),  or  (3)  25  percent  are  satisfied  witli  the  pay  they  receive.  The 
investigator  is  observing  values  and  attitudes  emitted  from  a  population 
in  response  to  open-ended  questions.  The  descriptive  information  supplied 
by  the  program  provides  an  index  of  the  saliency  of  these  feelings.  The 
program  would  also  allow  the  user  flexibility  in  comparing  various  groups 
of  respondents.  For  example,  if  the  sample  included  first-level  super¬ 
visors,  as  well  as  workers,  the  investigator  could  analyze  data  from  the 
two  groups  separately,  look  at  the  similarities  and  differences  in  their 
respective  response  patterns,  and  print  the  actual  responses  occurring 
in  each  category  under  the  appropriate  category  title.  This  permits  i 
inspection  of  the  raw  data  in  its  categorized  form.  This  option  can  be 
especially  effective  in  providing  individuals  (e.g.,  management),  a  clear 
picture  of  the  data  making  up  each  of  the  categories.  Additionally,  if 
more  than  one  data  categorization  is  conducted,  ei"hp.  by  using  the  same 
category  scheme  or  generating  new  categories,  the  user  can  compare  the  two. 

Tliis  program  also  has  potential  utility  in  the  area  of  training 
research.  In  addition  to  obtaining  objective  performance  criteria,  an 
investigator  may  desire  to  incorporate  a  trainee's  subjective  reactions 
into  tlie  evaluation  of  a  new  training  program.  Tims,  after  training  is 
complete,  a  trainee  could  be  asked  to  respond  to  an  open-ended  technique 
of  gathering  information  sucli  as  writing  a  paragraph,  listing  remarks, 
responding  in  an  interview,  etc.,  to  tap  their  attitudes/reactions  to  the 
training.  Once  this  information  has  been  collected,  it  can  be  categorized 
as  previously  described  in  the  worker  attitude  example.  At  this  point  the 
computer  program  can  provide  the  descriptive  statistics,  as  well  as  the 
options  of  (1)  making  group  comparisons,  (2)  listing  the  actual  responses, 
and  (3)  providing  classification  comparisons. 

Program  Description 

The  GURU  program  is  comprised  of  four  major  components:  (1)  UNIKOUNT, 
(2)  PER7.PR0B,  (3)  PRINT/PUNCH,  and  (4)  CATCOKP .  These  components  are 
described  in  detail  in  the  following  paragraphs. 
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unikount 


As  described  in  Barihol  and  do  Mi  lie  (1969),  "program  UNIKOUNT 
counts  responses  in  each  category  and  respondents  having  at  least  one 
response  in  each  category."  A  sample  output  is  provided  in  Appendix  A. 

UNIKOUNT  aggregates  all  occurrences  of  each  category  number  punched 
in  a  specified  reference  field  in  the  data  cards  to  arrive  at  the  response 
frequency  for  each  category  (one  card  per  response).  The  program  computes 
the  percentage  that  each  response  frequency  represents  of  the  total  number  of 
responses  in  the  analysis.  The  categories  are  ordered  by  card  frequency 
and  rank  numbers  art  computed  with  rank  number  1  (or  lowest  tie  number) 
being  assigned  to  the  category  having  the  most  cards.  A  rank-ordered 
table  is  printed,  with  each  row  showing  the  category  number,  card  fre¬ 
quency,  percentage,  rank  number,  and  title  for  that  category  (see  page 
A-4  and  A-5) .  The  output  also  includes  the  following:  (1)  main  title 
describing  the  input  data  (e.g. ,  "WAREHOUSEMEN"),  (2)  title  of  the  ref  ice 

field,  (3)  total  number  of  data  records  input,  (4)  number  of  error  car 
any  (i.e.,  cards  having  no  category  number  punched  in  the  specified  re  .nee 

field),  and  (5)  cards  remaining  in  the  analysis  after  deletion  of  the  error 
cards . 


The  second  UNIKOUNT  table  shows,  for  each  category,  the  number  of 
respondents  (subjects)  having  at  least  one  response  in  that  category  (see 
pages  A-6  and  A-7).  To  arrive  at  this  information,  UNIKOUNT  excludes  any 
redundant  responses  for  each  subject  in  the  category;  that  is,  one  response 
is  excluded  every  time  the  unique  identification  number  of  the  respondent 
is  repeated  in  one  category.  The  program  computes  the  percentage  that  the 
resulting  frequency  represents  of  the  total  number  of  respondents  found  in 
the  sample,  and  displays  these  percentages  in  a  table  similar  to  the  first 
UNIKOUNT  table.  The  number  of  redundant  responses  excluded  is  also  noted. 

PERZPUOB 


The  PERZPROB  option  (see  Barthol  &  de  Mille,  1969)  computes  the 
probability  of  observing  different  percentages  of  two  groups  of  respondents 
or  response  cards  represented  in  a  category.  A  sample  output  is  provided 
in  Appendix  B. 

The  program  prints  a  percentage  table  (pages  li— !  and  B-2)  showing 
categories  as  rows  and  groups  of  respondents  or  response  cards  as  columns. 
Each  (i'll  of  the  table  shows  the  percentage  of  one  group  of  respondents 
(subjects)  or  response  cards  that  were  represented  in  one  category. 

It  also  prints  a  probability  table  (pages  B-3  and  B-4)  showing  t  In- 
computed  probabilities  that  a  percentage  difference  as  large  as  the  observed 
difference  between  each  pair  of  groups,  in  each  category,  might  have  arisen 
by  chance.  Since  the  different  groups  do  not  include  the  same  respondents, 
the  percentages  or  proportions  are  independent,  and  the  s-mpling  distribu¬ 
tion  of  the  difference  between  two  proportions  is  approx  mately  normal. 
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The  formula  tor  computing  Z,  the  normal  deviate  (Wallis  &  Roberts, 
1  Si  5b)  Is: 

?  _ _ P1  ~  P2 

/ (PQ)  (T/n~  +  l/n2) 


wliere 


P 

1 

I> 

2 


n 

1 


n, 


P 


the  proportion  of  one  group  represented  in  the  category, 
the  proportion  of  the  other  group  represented  in  the  category, 
the  number  of  respondents  in  the  first  group, 
the  number  of  responuents  in  the  second  group, 

n 1 P1  +  n2p2 


and 

Q  =  1  -  P. 


Since  the  possible  values  of 


p0  are  not  continuously  distributed, 


a  correction  for  discreteness  is  applied  before  the  probability  is  computed, 
reducing  by  0.5  the  frequency  corresponding  to  the  larger  proportion 
(n^pp  and  increasing  by  0.5  the  frequency  corresponding  to  the  smaller 


proportion  (n7p0) . 


In  certain  instances,  the  user  may  have  an  extreme  j)  or  or  small  X. 
Thus,  the  program  follows  this  rule:  £  or  cj_,  whichever  is  smaller,  multiplied 
by  or  t{ 0 ,  whichever  is  smaller,  must  result  in  a  product  greater  than 

5  (Downey  &  heath,  1974).  If  the  product  is  less  than  or  equal  to  5,  the 
program  prints  zeros  followed  by  an  asterisk  wh<  T'e  the  probability  would 
normally  be  printed.  Tiiis  is  done  because  computing  Z  on  comparisons  where 
this  product  is  5  or  less  is  inappropriate.  The  Pisher's  exact  test  is  the 
correct  test  for  probability  with  such  comparisons  (hays,  1963).  Package 
computer  programs  are  available  for  this  test.  However,  if  the  product  is 
greater  than  5,  the  program  computes  and  prints  the  probability  associated 
with  that  comparison.  Each  value  in  the  probability  table  is  two  tailed, 
including  the  probability  of  the  difference  p^  >  p9,  as  well  as  the  differ¬ 
ence  p2  >  pj. 
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Additional  information  associated  with  the  percentage  table  includes: 
(1)  main  title  (e.g.,  "WAREHOUSEMEN"),  (2)  number  and  title  of  each  cate¬ 
gory,  (3)  number  and  title  of  eacli  group,  and  (4)  the  number  of  respondents 
or  response  cards  in  each  group. 

Each  column  in  the  probability  table  compares  two  groups  in  the  per¬ 
centage  table.  The  difference  associated  with  the  comparison  is  lollowed 
by  the  probability.  After  selecting  a  suitable  probability  1 evel  (for 
example,  0.50,  0.010,  0.001),  tiie  user  can  circle  the  cells  in  the  prob¬ 
ability  table  that  meet  that  criterion.  The  circled  cells  then  constitute 
an  index  of  significant  differences  in  the  percentage  table. 2 

The  input  data  for  PERZPROB  are  the  percentages  printed  in  the 
UNIROl'NT  table.  Thus,  UNIKOITNT  must  be  run  to  obtain  PERZPROB  output. 
However,  there  is  an  option  in  GURU  to  suppress  the  printing  of  the  UN'IKOUN’i 
tables . 

PRIST/PUNCH 

GURU  is  capable  of  printing  or  punching  individual  verbal  responses. 
For  example,  if  responses  were  categorized  and  the  user  wanted  to  see  the 
actual  responses  represented  by  the  frequencies  and  percentages  displayed 
in  UNIKOUNT,  the  verbal  responses  could  be  listed  by  category  (see  Appendix 
C).  This  option  is  particularly  useful  when  the  user  is  interested  in  seeing 
the  raw  data  displayed  in  classified  form.  Punching  may  be  useful  if  the 
user  wants  to  create  a  new  data  set  for  reclassification. 

CATCOMP 


CATCOMP  allows  the  user  to  compare  two  or  more  classifications  of 
the  same  data.  The  program  provides  a  listing  of  how  the  responses  in  each 
category  of  one  classification  are  distributed  among  tiie  categories  of  one  or 
more  other  classifications  and  prints  tiie  associated  category  titles,  fre¬ 
quencies,  and  percentages  for  eacii  (see  sample  output  in  Appendix  D) . 


'’it  has  been  stated  that  the  group  percentages  are  independent  because 
tiie  different  groups  do  not  contain  the  same  respondents.  This  means  that 
any  entry  in  tiie  probability  table  refers  to  a  comparison  of  two  independent 
percentages.  A  word  of  caution  is  necessary,  however,  when  the  user  circles 
two  or  more  entries  in  the  same  column  of  tiie  probability  table.  The  degree 
of  dependence  of  entries  in  the  same  column  is  indeterminate  but  should 
not  be  assumed  to  he  zero.  Conditions  tending  to  introduce  dependence  are: 
(1)  each  respondent  may  be  represented  in  several  cells  of  the  column,  (2) 
each  respondent  is  allowed  a  limited  number  of  responses,  which  may  be 
distributed  across  a  larger  number  of  cells,  and  (3)  the  cells  themselves 
may  be  assumed  not  to  exist  prior  to,  or  independent  of,  classification  of 
the  responses  into  categories.  Because  of  this  intracolumn  dependence,  the 
user  should  not  assume  a  simple  correspondence  between  the  number  of  cells 
circled  in  a  column  in  t lie  probability  table  and  the  degree  of  dissimilarity 
between  two  groups  of  respondents. 
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It  is  also  capable  of  comparing  classification  schemes  within  subsamples. 

For  example,  an  investigator  may  have  a  set  of  1000  responses  that  have 
been  independently  categorized  by  two  classifiers  producing  separate 
classifications  and  C^.  If  and  vrere  comprised  of  20  and  26  cate¬ 
gories  respectively,  the  user  might  be  interested  in  empirically  comparing 
the  classif ications  to  get  an  index  of  the  reliability  between  the  two  class¬ 
ifications  or  to  study  category  formation/classification  strategies  of  the 
different  classifiers.  The  CATCOMP  program  will  take  a  particular  classi¬ 
fication  and  show  by  category  how  the  responses  within  each  of  the  cate¬ 
gories  are  distributed  over  another  set  of  categories  C^.  It  could  show 
that  the  responses  in  category  3  for  all  occurred  in  category  8  for 
or  that  the  50  responses  in  category  5  for  were  distributed  over  several 
categories  in  C^.  A  comparison  of  the  corresponding  category  titles  of  the 

two  classification  schemes  could  provide  the  investigator  useful  informa¬ 
tion  about  what  the  classifier  in  called  the  same  responses  classified 

by  the  classifier  in  C^.  Information  concerning  classifiers’  perceptions 

and  level  of  abstraction  during  categorization  may  be  derived.  Additionally, 
the  investigator  could  make  similar  comparisons  between  classifications  on 
specif ied  subsamples  of  the  population  of  responses.  Such  a  comparison 
is  demonstrated  in  Appendix  D. 

Input  Data  Format 

The  GURU  program  reads  fixed  length  data  records  of  any  size.  The 
columns  for  each  classification  must  be  the  same  on  each  record.  Each 
subject  can  have  any  number  of  classification  records.  This  would  be  used 
if  a  subject  was  asked  more  than  one  question  or  a  single  question  more 
than  one  time. 

If  the  option  to  PRINT/PUNCH  the  actual  verbal  responses  is  chosen, 
those  responses  should  be  on  fixed  length  data  records  of  any  size  (not 
necessarily  the  same  record  size  as  the  classification  records),  in  a 
separate  file.  An  identification  field  of  consecutive  columns  must  appear 
on  both  the  classification  cards  and  the  verbal  response  cards.  However, 
these  fields  need  not  be  in  the  same  columns  on  both  types  of  records. 

Capabilities  and  Limitations 

The  capabilities  and  limitations  of  the  GURU  program  are  listed  below: 

1.  GURU  allows  909  runs  to  be  made  in  a  single  submittal. 

2.  Run  title  cards  allow  for  an  80-character  description. 

3.  There  may  be  up  to  10  groups  per  run. 

4.  Group  title  cards  allow  for  a  40-character  description. 
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‘j.  There  may  be  20  numeric  values  ior  each  group. 

().  The  program  can  group  nonconsecut  ive  values. 

7.  The  reference  field  title  may  be  up  to  40  characters. 

8.  If  there  is  more  than  one  classification  record  per  subject,  those 
records  must  be  located  together  on  the  input  file,  not  necessarily  in 

any  sorted  order.  This  requirement  applies  when  the  percent  subjects 
option  is  chosen. 


The  necessary  job  control  language  to  execute  GUKT  is  provided  in 
Appendix  h.  A  sample  of  a  source  listing  is  provided  in  Appendix  F. 
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LIKING  PARTICULAR  PARTS  OF  THE  JOB 


category  Raw  frequency  percent  subjects  rank  category  title 


26.0  DOESN’T  ANSWER  QUESTION 


BROADWAY  WAREHOUSEMEN 
ANNEX  WAREHOUSEMEN 


B-2 


T«0. T AILED  PROBABILITY  OF  A  DIFFERENCE  OF  THIS  MAGNITUDE  ARISING  BT  CHANCE 
(•  INDICATES  THAT  ThE  PROBABILITY  TEST  FOR  THIS  COMPARISON  IS  INAPPROPRIATE  I  SEE  OOCUMENTATlONM 


APPENDIX  C 

PRINT/ PUNCH  SAMPLE  OUTPUT 


C-0 


BiQAOMAX-WmttOUSEBC&l 


TO  ACCOMPLISH  THE  JOB 
NO  COMMENT 


1 


APPENDIX  D 


CATCOMP  SAMPLE  OUTPUT 


CATEGORY  COMPARISONS  FOR  -  SAMPLE  RUN  OF  GURU  PROGRAM 
REFERENCE  FIELD  •  INOIGENOUS  CLASSIFICATION 


H  <V 


CATESj =>T  COMPARISONS  FOR  -  SAMPLE  RUN  OF  GURU  PROGRAM 
REFERLNCt  FIELD  -  INOIGENOUS  CLASSIFICATION 


CATC60RT  COMPARISONS  FOR  -  SAMPLE  RUN  OF  GURU  PROGRAM 
REFERENCE  FIELD  -  INDIGENOUS  CLASSIFICATION 


PROFESSIONAL  CLASSIFICATION  FREQUENCY  «  CaTESORT  31  «  CATC60RV  31 

CATEGORY  NUMBER  ANO  TITLE  IN  GROUP  1 
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APPENDIX  K 


GURU  JOB  CONTROL  LANGUAGE 


GURU  JOli  CONTROL  LANCUAGE 


The  necessary  job  control  language  to  execute  GIT. I'  follows: 

a.  //GO  EXEC  POM-Ol’Rl' ,  REGIGG=243K, 

//  parm*’ isa(igor)  ’ 

b.  //GO. STEP i. IB  00  DSA=EC;iO, DISPOSER 

o .  / /CO. S VS IX  DO  * 

[SETUP  CAROS j 

d.  //GO . SYSPRIXT  DD  SYSOUT=A 

e.  //GO.PNCH  DD  SYSOUT=B,DCB=BLKSIZI>30 

f.l.  //GO. TEMPI  DD  DSN=&&TEMP1, 

//  UNIT=SYSDA,DISP=NEW, 

//  SPACE* (6000, (15,10))  , 

//  DCB*(RECFM=FB,LRECL=100,BLKSIZE=6000) 

Supply  as  many  of  card  f.l's  as  the  largest  number  of  groups 
in  any  of  the  runs.  E.G.  for  3  groups,  TEMPI,  TEMP2  &  TEMP 3 
need  be  supplied. 
f.2.  / /G0.TEMP3  DD  DUMMY 

Supply  f.2’s  through  TEMP10  following  the  f.l’s.  E.G.  if 
the  maximum  number  of  groups  is  3,  there  should  be  3  f.l  cards 
and  7  f.2  cards. 

g.  //GO. WORDS  DD  file  containing  verbal  response  records 

h.  //GO. CARD 1  DD  file  containing  classification  records 

i.  //CO.PLIDUMP  DD  SYS0UT=A 

j  •  /* 


E-i 


SETUr 


The  following  is  the  general  setup  for  GURU. 

'll  numbers  are  right-justified. 

Card  A 

Col  27-29  Number  of  runs  to  oe  made. 

C  a  ra  B 

Col  -*0— *4  3  Log  tea  1  record  ieagte.  ■  ’  r  euassi.  nitron  rocores. 

.'art!  C 

Col  24-25  Classification  record  identification  column. 

Card  D 

Col  23-24  Classification  record  identification  width.. 

Card  E 

Col  23-25  Subject  number  column. 

Col  49-50  Subject  number  width. 

Card  F 

Col  15  Flag  for  existence  of  verbal  response  records. 

0  -  No 
1  -  Yes 

[CARDS  G-K  REQUIRED  ONLY  IF  CARD  F  IS  1] 

Card  G 

Col  37-40  Logical  record  length  for  verbal  response  records. 
Card  H 

Col  25-27  Verbal  response  record  identification  column. 

Card  I 

Cel  24-25  Verbal  response  record  identification  width. 
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Card  J 


Col  29 

Card  k 

Free  Format 

Card  l 

Col  1-80 
Card  M 

Col  31-32 
Card  N 

Col  22-23 
Card  0 

Col  43-47 
Card  P 

Col  1-80 
Card  Q 

Free  format 


Card  R 

Col  1-80 


Number 

of  verbal  response 

sections 

(«S.  g-  , 

if  respons 

are  on 

two  cards  with  an 

ID  in  col 

72-30, 

the  number 

sec t io: 

:S  would  be  2,  C.il'C 

I ,  co Is  1 

-72;  Co 

rd  2,  cols 

Verbal  response  sec  cions  followed  by  their  lengths  (e.  g. 
in  the  ex_r.pl e  for  car.:  J ,  card  G  would  have  160,  card  J 
would  have  2,  and  card  k  ■....•eld  have  1  72  31  72). 

Title  card  for  the  run. 

Number  of  groups  for  this  run. 

Number  of  categories  to  be  found  in  the  reference  field. 

Maximum  number  of  records  expected  for  any  one  group. 

"Number  of  group  values  to  follow" 

Number  of  values  per  group  (e.g.,  if  there  are  4  groups 
in  a  run,  4  numbers  would  appear  on  this  card,  separated 

by  one  or  more  blanks). 

"Group  values  to  follow" 
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Card(s)  S 


Free  format  Actual  group  values.  One  card  S  must  appear  for  each 

group.  Values  appear  on  the  cards  separated  by  a  blank 
(e.g. ,  if  there  are  four  groups  for  tills  run  four  S 
cards,  containing  the  values  in  each  group,  must  he 
3ug p 1 led  i  • 

Card  T 

Col  1-80  "Croup  titles  (40  characters)". 

Card(s)  U 

Col  1-40  Group  title.  One  card  U  must  appear  for  each  group. 

Card  V 

Col  14-16  Starting  column  of  the  variable  used  to  split  groups. 

Col  33-35  Width  of  that  variable. 

Card  W 

Col  1-80  "Category  values  and  titles  to  follow" 

Card (s)  X 

Col  6-7  Category  number. 

Col  17-80  Category  title.  These  categories  are  determined  by 
the  classifiers.  There  should  be  the  sane  number  of 
X  cards  as  specified  on  card  N. 

Card  Y 

Col  7  Print  switch.  If  any  printing  or  punching  of  the 

actual  responses  is  desired,  this  switch  should  be 


set  to  one. 


APPENDIX  F 
GURU  SOURCE  LISTING 


F-0 


Col  17 


Pun.:  h 


11  the  responses  aru  to  be  punched 


Col  26 

C  ird  2 

Col  9 

Col  20 

Col  34 

Col  50 

Col  67 

Card  <\A 

Col  21 

Card  BB 

Col  24-63 
Card  CC 

Col  24-27 


.v.:  itch. 

on  cards,  this  switch  should  be  sot  to  one. 

List  switch.  If  the  actual  responses  are  to  be  printed, 
this  switch  should  be  sat  to  one. 

Compute  switch.  If  UNIKOUNT  or  PER2PS0B  is  to  be 
executed,  this  switch  should  be  set  to  one. 

If  the  UN IKCCbT  output  for  the  percent  responses  is 
desired,  this  column  should  be  set  to  one. 

If  the  UNIKOUNT  output  for  the  percent  subjects  is 
desired,  this  column  should  be  set  to  one. 

If  only  UN I  KOI’ NT  is  to  be  performed,  this  column  should 
be  set  to  one. 

If  PERZPROB  is  being  performed,  in  order  to  get  the 
UN IKOU NT  output  as  well,  this  column  should  be  set 
to  one. 

Flag  for  category  comparisons. 

0  -  No 
1  -  Yes 

Reference  field  title. 

Reference  field  column. 
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Card  DO 


Col  2J-25  Reference  field  width. 

[CARDS  LE-riH  A-t-QC  I  K_4D  ONLY  IF  CAiLD  AA  =  1] 

a.  rd  Lc 

Col  45  Number  of  other  class  if icat ions  to  be  c^-jared  vita 


Card  FF 

Coi  59-ul  Maximum  number  of  categories  in  other  class  if ications . 

Card  CG 

Col  1-40  Classification  tide 

Col  41-44  Classification  starting  column 

Col  45-48  Classification  width 

Col  49-51  Number  of  categories  in  this  classification. 

Card  HH 

Col  6-7  Category  number. 

Col  17-80  Category  title.  There  should  be  the  same  number  of  HH 
cards  as  specified  in  cols  49-51  of  the  GG  card. 

NOTE: 

1.  Tl^re  should  he  the  same  number  of  FF  cards  as  specified  in  coi 
45  in  card  EE. 

2.  There  should  be  the  same  number  of  cards  L-HH  as  specified  in 
card  A. 
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PL/I  OPTIMIZING  COMPILER  (SU6SCRIPTRAN6EI 
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ELEMENTS  IN  AN  ARRAY  */ 


X 

3T 

O 


O 

Z 


a 

o 


O' 

<N 


-  X 

•-I  fl  II  If  < 
3 

J  •-*  "5  -C  V) 
O 

O  O 

2 


*3**3 


<X)  O'  O  r-»  <M 

0*  IT  o  c  c 

f\J  <M  *0  K)  IO 


Z 

r 


o 

x 

CL 


o 

o 


Q 

a 

o 

LD 

Z 

c 

2 

UJ 

u 

V) 

UJ 

e 

z 

s 

•c 

* 

*7 

</) 

>- 

< 

X 

X 


a 

UJ 


V)  +•*  •* 

cc  ~ 

UJ  ^ 

—  sfi  ■ 


D 

O 

U 

U 

o 

X 

X 


*  X 
« 

x  x  ; 

*  o 

*  ~ 
>-  •  i 
“7  —  i 

~  c  ; 

_i  -i 
u  o 
o  o 


o 


o 

o 


LJ  "  *H 

X  •-*  r-*  ►h 

—  *-  ++  ►<  — 

»-  - r 

*-■  ~  “7  X  * 

X  H  •  • 

-J  —  — 

~  —  r 

O  -  “7  X  — 

•KHII - CM  — 

«  A  X  X  X  ++ 

-*  ♦  <Z  <  < 

--333  II 
>-**-«  1/5  </)  Cfl 


II 

X 

UJ  »-* 

o 

z  o 

“•  *3 


ii  *7  -I  j  j  id 

J  J  _J  C7 

«-»  U.  <1  «  «  Z 

U  U  U  -• 


z 

X 
1 7 


2  - 
Id  .M 

X 


x 

u» 

II  Q 
v  z  - 

*d 

X  z 
who 
a 

2  h  O  CO 

••  H  H  K  2 

0X3 

z  u.  ►*  o  o 

U  h  J  :•)  "1 


UJ 

z 

o 


&  in  %o 
o  o  o 

*>  *7  K) 


*-<  <\/  rvj  ry  t\j  fv  Cu  <v 


CO  O'  O  «-l  (\»  *0  Cf  If) 
O  C  »m  «■* 

•O  Kl  KHO  IO  K)  IO  pO 


vs  r-  ao  0s 

H  H  rl  H 
•o  io  io  »o 


a 


r-i? 


this  procepure  calculates  the  category  percentages  for  unikount 


I 

o 

z 

O 

O 

lx 


V)  o 
O  M- 

o 

O  r-t 

U5 

II 

II 


O  O 
t-  3 


O 

<3 


Z 

U 

I 


O 

o 


►-  -  -J 

—  I-  1“  -I 
>-  •-  O  <X  — 
Of-l-UO 
O  H  \  H  J 

1$  ll.  o 

Ui  0  OX 

*-  UJ  II  H-  I 

4  0 £  k- 

O  U.  ►-  r-l  < 

Z  II  u 

II  II  Cl  -J 

K  JU.  - 

u  u.  a  *-<  o 
o  z 


—  m 

*  — 

z  u. 

<  • 

(X  —  — 


o  in 
o  n 


x 

o 


2 

a 

x 


u 

I 

o 

z 

o 

o 

u. 


X 

z> 
o 

a  — 

O  K- 

*>♦  Z 
o 
to  cr 
o  a 
c r 

4  II  »- 
U  Z 

I  *  z  u 
n  j  uj  a 
cr  j  X  x 
ui  •  • 

o  a  il 

Dh  • 

z  o  o 

u  a  ii  - 
to  t- 


-  2  O  -  IT  • 
HMDynHtf 
-5  0 

ii  c  r  -  uj  j  u 

D  >-»  <M  J  O  • 

H  1/5  Z  «  H  U  -> 

*  I  D  1  H  •  ® 

•-»  rvj  i  h  k  «  ^ 

z  k  k  it  1  in 

DUJZWK  -  J 
IQ.  h  4  lx  O 

z  tr  »-  u  —  o 

HUIQ.D 

(/)  X 


*n  o 

w  o 

x  - 


o  z 
z  cr 
ui  a 

o 

ui 


o 

h- 

p* 

1 

1 

2 

2 

(M  OJ  «0  IO 

OI 

Oj  fg 

<g  -» 

z 

2 

M 

eg 

> 

to 

^  ^  ^ 

*■  if  sf  * 

if 

3  3 

H 

ui 

r 

J 

M 

t“ 

►- 

o 

H  (V  TO  ^ 

in  VO  * 

O 

OJ  to 

X 

r 

Oj  OJ  <\j  <\ 

<Nl  (M  CVi  Oj 

eg 

to  to 

»0  to 

o 

►- 

tO 

tO  fO  tO  tO 

tO  tO  tO  tO 

to 

to  to 

to  to 

-J 

a 


F-13 


THIS  PROCEDURE  IS  THE  UMKOUNT  PROCEDURE  FOR  PERCENT  SUBJECTS 


399  <*  l  CALL  DO«N2  (FRE3. CATEGORY  ,CAT_T  I  TLF  .  ICAT  )  i 

390  “1  JO  I  :  I  Id  TCAT I 

391  9  P  ir  F'RLw(I)  =  0  THl\  r.^  TO  ['I'lMHj 
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fHI S  PROCEDURE  SORTS  arrays  J  S  K  in  ascending  order 
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